yzBigData: Provisioning Customizable Solution for Big Data

نویسندگان

  • Sai Wu
  • Gang Chen
  • Ke Chen
  • Lidan Shou
  • Hui Cao
  • He Bai
چکیده

YZStack is our developing solution which implements many wellestablished big data techniques as selectable modules and allows users to customize their systems as a process of module selection. In particular, it includes an openstack based IaaS (Infrastructure as a Service) layer, a distributed file system based DaaS (Data as a Service) layer, a PaaS (Platform as a Service) layer equipped with parallel processing techniques and a SaaS (Software as a Service) layer with popular data analytic algorithms. Layers of YZStack are loosely connected, so that customization of one layer does not affect the other layers and their interactions. In this paper, we use a smart financial system developed for the Zhejiang Provincial Department of Finance to demonstrate how to leverage YZStack to speed up the implementation of big data system. We also introduce two popular applications of the financial system, economic prediction and detection of improper payment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

When to use 3D Die-Stacked Memory for Bandwidth-Constrained Big Data Workloads

Response time requirements for big data processing systems are shrinking. To meet this strict response time requirement, many big data systems store all or most of their data in main memory to reduce the access latency. Main memory capacities have grown, and systems with 2 TB of main memory capacity available today. However, the rate at which processors can access this data—the memory bandwidth...

متن کامل

Optical networks for cost-efficient and scalable provisioning of big data traffic

This article shows how recent advances in optical networks can be utilized to improve big data processing by cost effective and scalable provisioning of high-bandwidth connectivity for big data traffic in backbone networks and consequently tackle the current problems related to big data processing in distributed environment including cloud computing. We focus on two optical technologies, namely...

متن کامل

CSPE: Cloud Storage Provisioning Decided by Rate of Return and Workload Characteristics

As recent report [1] claims, the capacity of digital content on the Internet has amounted to 500 billion GB. What is more, this number is estimated to be double in next year. The emerging of cloud computing offers a rather feasible solution to the problem of information explosion. Thus, for those IT enterprises with high demand of storage, a big concern is to determine whether it is cost effect...

متن کامل

Cloud Template, a Big Data Solution

Today cloud computing has become as a new concept for hosting and delivering different services over the Internet for big data solutions. Cloud computing is attractive to different business owners of both small and enterprise as it eliminates the requirement for users to plan ahead for provisioning, and allows enterprises to start from the small and increase resources only when there is a rise ...

متن کامل

Digital still cameras and mobile agents: How to create a distributed service for image processing

The new distributed multimedia applications require more and more to manage user’s mobility. The opportunity of accessing the data at any time, from any place, and with terminals having several processing capabilities, is one of the most important features required. Adequate mechanisms need therefore to be developed, in order to manage the user’s mobility and the distributed processing of data ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2014